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Abstract 

We present new results concerning threshold functions for a wide family of ran- 
dom intersection graphs. To this end we apply the coupling method used for estab- 
lishing threshold functions for homogeneous random intersection graphs introduced 
by Karohski, Scheinerman, and Singer-Cohen. In the case of inhomogeneous ran- 
dom intersection graphs the method has to be considerably modified and extended. 
By means of the altered method we are able to establish threshold functions for 
a general random intersection graph for such properties as ^-connectivity, match- 
ing containment or hamiltonicity. Moreover using the new approach we manage to 
sharpen the best known results concerning homogeneous random intersection graph. 

1 Introduction 

The first random intersection graph model was introduced by Karohski, Scheinerman, 
and Singer-Cohen [IT]. Since than it has been attracting attention mainly because of 
its wide applications, for example: "gate matrix layout" for VLSI design (see e.g. [llj), 
cluster analysis and classification (see e.g. [9]), analysis of complex networks (see e.g. 
[TJ ?]), secure wireless networks (see e.g. [2]) and epidemics (|6]). Several generalisations 
of the model has been proposed, mainly in order to adapt it to use in some particular 
purpose. In this paper we consider the Q (n, m,p) model studied for example in [H [51IT2]. 
Alternative ways of generalizing the model defined in jTl] are given for example in [7] and 

i- 

In a random intersection graph Q {n, m,p) there is a set of n vertices V = {vi, . . . , 
an auxiliary set of m = m{n) features W = {wi, . . . ,Wm{n)}, and a vector p{n) = 
(pi, . . . ,Pm(n)) such that Pi G (0; 1), for each 1 < i < m. Each vertex v E V adds 
a feature Wi E W to its feature set W{v) with probability pi independently of all other 
properties and features. Any two vertices v,v' eV are connected by an edge in Q (n, m, p) 
if W{v) and W{v') intersect. If p{n) = {p, . . . ,p) for some p G (0; 1) then Q {71,171, p) is a 
random intersection graph defined in |Tl]. We denote it by Q {n,m,p). 

The random intersection graph model is very flexible and its properties change a lot if 
we alter the parameters. For example Q {n,m,p) for some ranges of parameters behaves 
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similarly to a random graph with independent edges (see [H [13]) but in some cases it 
exhibit large dependencies between edge appearance (see for example [HI [15]). It was 
proved in [H] that in both cases Q {n,m,p) may be coupled with a random graph with 
independent edges so that with probability tending to 1 as n — )■ oo, Q{n,m,p) is an 
overgraph of a graph with independent edges. It is also explained how this coupling may 
be used to obtain sharp results on threshold functions for Q {n,m,p). Such properties 
as connectivity, a Hamilton cycle containment or a matching containment are given as 
examples. In general, the coupling technique provides a very elegant method to get bounds 
on threshold functions for random intersection graphs for a large class of properties. 

However the proof presented in pjj cannot be straightforward generalised to Q (n, m, p) 
with arbitrary p{n). First of all it differentiates between cases np — )■ and np — )■ oo. 
Moreover the method does not give sharp results for np tending to a constant. In this 
article we modify and extend the techniques used in [H] in order to overcome these 
constraints. First of all, to get the general result, we couple Q {n,m,p) with an auxiliary 
random graph which does not have fully independent edges. Therefore we need to prove 
some additional facts about the auxiliary random graph. Moreover we need sharp bounds 
on the minimum degree threshold function for Q {n,m,p). Due to edge dependencies, 
estimation of moments of the random variable counting vertices with a given degree 
in Q {n,m,p) is complicated. Therefore we suggest a different approach to resolve the 
problem. We divide Q {n,m,p) into subgraphs so that the solution of a coupon collector 
problem combined with the method of moments provide the answer. This new approach to 
the coupling method allow us to obtain better results on threshold functions for Q (n, m, p) 
and by this means resolve open problems left over in P3]. 

Concluding, we provide a general method to establish bounds on threshold functions 
for many properties for Q (n, m,p). By means of the method we are able to obtain sharp 
thresholds for /c-connectivity, perfect matching containment and hamiltonicity for the 
general model. Last but not least we considerably improve known results concerning 
Q {n,m,p). 

All limits in the paper are taken as n — )■ oo. Throughout the paper we use standard 
asymptotic notation o(-), O(-), ©(■)) ~) ^) ^ defined as in [10]. By Bin(n,p) 
and Po (A) we denote the binomial distribution with parameters n, p and the Poisson 
distribution with expected value A, respectively. We also use the phrase "with high 
probability" to say with probability tending to one as n tends to infinity. All inequalities 
hold for n large enough. 

2 Main Results 

In the article we compare random intersection graph Q {n,m,p) with a sum of a random 
graph with independent edges 6*2(^,^2) and a random graph 6*3(^,^3) constructed on the 
base of a random 3-uniform hypergraph with independent hyperedges. Generally, for any 
p = p{n) G [0; 1] and i = 2, . . . , n, let Hi{n,p) be an i-uniform hypergraph with the vertex 
set V in which each i-element subset of V is added to the hyperedge set independently 
with probability p. Gi{n,p) is a graph with the vertex set V and an edge set consisting 
of those two element subsets of V which are subsets of at least one hyperedge of Hi{n,p). 
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We consider monotone graph properties of random graphs. For the family Q of all 
graphs with the vertex set V, we call A Q a property if it is closed under isomorphism. 
Moreover A is increasing ii G E A implies G' E A for all G' E Q such that E{G) C E{G') 
and decreasing if Q\A is increasing. Increasing properties are for example: /^-connectivity, 
containing a perfect matching and containing a Hamilton cycle. 

Let p = {pi, . . . ,Pm) be such that pi G (0, 1), for all 1 < z < m. Define 

m 

2np,, 

^1'* = EH J^^^l-P^)""*' fort = 2, 3, 



m 



i=l 



n-1 



The following theorem is an extension of the result obtained in [T 

Theorem 1. Let Si, S2 and S3 be given by ([T]). For some function cu tending to infinity 
let 



P 



2^-2 



S2 — C0VS2 — 25*2^ 
2(f) 



for S3 > and < Ss/^ST 



(2) ^^--| ..-.v^-^25^-^ ^ /or53 = 0(v/5r); 



P3 



for S3 ^ v5i and u <^ 53/ ySi; 



0, forS3 = 0{^/S;). 

If Si ^ 00 and Si = (n^) then for any increasing property A. 

(3) lim inf Pr {G2 {n, p) G A} < lim sup Pr {Q (n, m, p) G A} , 

(4) lim inf Pr {G2 iji, P2) U 6*3 {n, P3) G A} < lim sup Pr {Q (n, m, p) G A} . 

Remark 1. Assumption 5*1 — )■ 00 is natural since for 5*1 = o(l) with high probability 
Q {n, m,p) is an empty graph. 

Remark 2. ^3 is the expected number of edges in G3 {n,p3). If S3 = 0{^/Sl) then by 
Markov's inequality with high probability the number of edges in G3 {n,p3) is at most 
co^/Sl. Thus S2 = Si — S3 = Si + 0{u^/Sl) and the bound provided by is as good as 
the one taking into consideration the edges from G3 {n,p3). 
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Remark 3. Theorem is also valid for Si = f2(n^) but with 

5*2 — UJy/ S2 



P 



P2 



P3 




for ^3 ^ \/Si and u ^ S^/ y/Si; 
for ^3 = O(v^); 

for 5*3 > and lu <^ S3/ a/57; 
for ^3 = O(v^). 



Denote by C^, VM. and l-iC the following graph properties: a graph is ^-connected, 
has a perfect matching and has a Hamilton cycle, respectively. We will use Theorem [1] 
to establish threshold functions for C^, VM. and 1-iC in Q {n,m,p). By Ck we denote 
here vertex connectivity. From the proof it follows that the threshold function for edge 
connectivity is the same as this for Ck- 

For any sequence c„ with limit we write 

for Cn — —00; 

(5) f{cn) = { e""""" for c„ -> c G {-00; 00); 

for Cn — 00. 

Theorem 2. Let maxi<j<mPi = o{{lnn)^^) and Si and be given by ([1]). 
(i) If Si = n{\ian + Cn), then 

lim Pr {g {n,m,p) e Ci} = /(c„), 

n—^cx3 

where f{cn) is given by ([5]). 

CI 

(a) Let k be a positive integer and an = -k^. If Si = n(\nn + (A; — 1) Inlnn + c„), then 




lim Pr {Q {n, m, p) G Ck\ 



for Cn — —00 and an a E (0; 1]; 

1 for Cn — 00. 



Assumption maxi<j<mPi = o((ln?T,)~^) is necessary to avoid awkward cases. The 
problem is explained in more detail in Section |H A straightforward corollary of the above 
theorem is that for Si = ?T,(lnn + c„), c„ — > —00 and any k = 1,2, . . . ,n. 

lim Pr {Q {n, m,p) G Ck] = and lim Pr {Q {n, m,p) G l-iC} = 0. 

Theorem 3. Let maxi<j<mPt = o((lnn)^^) and Si be given by ([1]). If Si = n(lnn + Cn) 

then 

lim FT{g{2n,m,p{2n)) G PM} = /(c2„), 

n—>-co 

where /(■) is given by ([5]). 
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Theorem 4. Let maxi<j<mPi = o((lnn) Si and Si^2 be given by ([T]) and an = 
If Si = n(ln n + In In n + c„) , then 



lim Pr{^ (n,'m,p) G T^C} 



/or c„ —7- — oo and a.^ a E (0; 1] 

1 for Cn — oo. 



Already simple corollaries of Theorems |2]-H] give sharp threshold functions for Q {n, m,p). 
For example. 

Corollary 1. Let m ^ n and p{l — {1 — p)^~^) = (Inn + Cn)/m. Then 

lim Pt {g {n,m,p) e Ci} = /(c„) 

n—>OD 

and 

lim FT{g{2n,m,p) E VM} = /(c2„), 

n— >oo 

where /(■) zs g'Zi'en 6y ([5]). 

In particular we may state the following extension of the result from [T7] . 

Corollary 2. Lei 6„ &e a sequence, /3 anc? 7 &e constants such that /37(1 — e~'^) = 1. // 

m = /3n\nn and p = — ( 1 + r—^] then 

n \ inn J 

f ( e"^7 
lim Pr (n, m, G Ci} = / 1 + 

n^oo \ \ 1 — e ' 



e '''7 



and 

lim Pr{^(2n,m,p(2n)) G = / ( ( 1 + — — — ] fogn 

n-5>oo y y I — e ^ J 

where /(■) zs g'zt;en by ([5]). 

Sometimes the method of the proof enable to improve the best known results concern- 
ing Q (n, m,p) even more. 

Theorem 5. Let m 3> In^ n and 



Inn + In fmax |l,ln 

(6) p{i - {1 - pr-') = ^ — ^ ^ 



m 
Then 

lim Pr {Q (n, m, p) G HC} = 



/or c„ -00; 

1 /or c„ — > 00. 
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Theorem 6. Let m ^ In^n and k be a positive integer. If 



Inn + In j max < 1, (np)^ ^ j ( 
p(l - (1 - p)"-') = ^ ^ ^ 



X fc-1 

e "P In n \ , e "P\nn 



+ Cn 



then 

lim Pr {Q (n, m, p) G Ck} 



for Cn — — oo; 
for Cn — c>o. 



One of the question posed in [H] concerned the range of m = m{n) for which the 
threshold function for for Q (n, m, p) coincides with this for 5{Q {n, m,p)) > 1. Moreover 
we may ask when threshold function for Ck for Q {n,m,p) is the same as this for Ck for 
G2 {n,p) with p = mp^. Theorem [6] gives a final answer to these questions. 

Corollary 3. Let k he a positive integer. If 

lnri+c„ £ 1ti2 nlnn 



p{\ - (1 -p) 

then 



n-l^ _ ; for In n <^ m <^ ^ 

In n+(fc-i) In In n+c„ ^ /or m = fi(n lu n) ; 



lim Pr {Q (n, m, p) G C^} 



for Cn -00; 
for Cn -> 00. 



The proof is divided as follows. Section 3 describes the coupling used to establish 
threshold functions and presents the proof of Theorem [H In Section 4 we give minimum 
degree thresholds for Q {n,m,p). Section 5 is dedicated to the properties of the auxiliary 
random graphs used in the coupling established in Theorem [1] The proofs of the remaining 
theorems are presented in Section 6. 



3 Coupling 

In this section we present a proof of Theorem [H In the proof we use auxiliary random 
graph models G*j (n, M), i = 2,3, ... ,n, in which M is a random variable with non- 
negative integer values. For i = 2, . . . , n, G*i {n, M) is constructed on the basis of a 
random hypergraph ]HI*j(?T,, M). ]HI=i,j(n, M) is a random hypergraph with the vertex set V 
in which the hyperedge set is constructed by sampling M times with repetition elements 
from the set of all z-element subsets of V (all sets which are chosen several times are added 
only once to the hyperedge set). G^kj (n, M) is a graph with the vertex set V in which 
f , f ' G V are connected by an edge if {f , v'} is contained in at least one of the hyperedges 
of IH[^j(n,M). If M equals a constant t with probability one or has the Poisson distri- 
bution, we write G^j (n, t) or G=i,j (n, Po (■)), respectively. Recall that similarly Gi{n,p) is 
constructed on the basis of Hi{n,p) - a hypergraph with independent hyperedges. 

In this paper we treat random graphs as random variables. By a coupling (Gi,G2) 
of two random variables Gi and G2 we mean a choice of a probability space on which a 
random vector (G'^,G2) is defined and G'^^ and Gg have the same distributions as Gi and 
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G2, respectively. For simplicity of notation we will not differentiate between (G'^, Gg) and 
(Gi, G2). For two graph valued random variables Gi and G2 we write 

Gi ^ G2 or Gi ^i-a(i) G2, 

if there exists a coupling (Gi,G2), such that in the probability space of the coupling Gi 
is a subgraph of G2 with probability 1 or 1 — o(l), respectively. Moreover, we write 

Gi = G2, 

if Gi and G2 have the same probability distribution (equivalently there exists a coupling 
(Gi, G2) such that Gi = G2 with probability one). 

Note that, for any A, in EI*j(n, Po (A)) each edge appears independently with proba- 
bility 1 - exp(-A/(f)) (see [8]). Thus 

(7) G,i(n,Po(A)) = G, (^n,l-exp ^-a/ 

We gather here a few useful facts concerning couplings of random graphs. For proofs 
see [131 E]. 

Fact 1. Let Mn be a sequence of random variables and let tn be a sequence of numbers, 
(i) If Pi {Mn > tn} = 0(1) then G^ (n, M„) ^i-o(i) (n,t„) . 
(li) If Pi {Mn < tn} = 0(1) then G*i ^i-o(i) (n, M„) . 

Fact 2. Let {Gi)i=i,...^m 0''nd (G9j=i^..._m be sequences of independent random graphs. If 

^ G'j, for all i = 1, . . . ,m, 

then 

m m 
i=l i=l 

Fact 3. Let Gi = Gi(n), G2 = G2(ri), and G3 = G^in) be random graphs. If 

^i-o(i) G2 and G2 ^i-o(i) 

then 

Gi ^i-o(i) G3. 

Fact 4. Let Gi = Gi(n) and G2 = Gt2{f^) be two random graphs, such that 

(8) Gi ^ G2 or Gi ^i-o(i) G2. 
Then for any increasing property A 

lim inf Pr {Gi {n) e A} < lim sup Pr {G2 {n) G A} . 
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Proof. Define event S := {Gi C G2} on the probability space of tlie coupling (Gi,G2) 
existing by (|8]). Then for any increasing property A 



Pr {G2 eA}>PT {G2 G A\£} Pt{S} 

> Pr{Gi e A\S}Pt{S} 
= Pt {{Gi e A} n S} 

= Pr {Gi G ^} + Pr {£} - Pr {{Gi e A} U S} 

> Pr{Gi G ^} + Pr{^} - 1 
= Pr{Gi eA} + o{l). 



The result follows by taking n — i- 00 □ 

Proof of Theorem U\ We will show only (jl]) in the case 5*3 ^ -\/>S7- The remaining cases 
follow by similar arguments. Here we should note that S2 = Si — and S2 = 0(5'i). 

Let Wi G W. Denote by Vi the set of vertices which have chosen feature Wi (i.e. Vi = 
{veV:w,e W{v)}). Let 



(9) v: 

For each 1 < i < m define 



for \Vi\ > 2; 
otherwise. 



X, = \V,\; 

(10) Y, = \V:\; 

Zi = I{Yi is odd}, 

where is an indicator random variable of the event A. Note that Xj, 1 < i < m, are 
independent random variables with binomial distributions Bin(n,pj), 1 < i < m. Now 
let 

l<i<m l<i<m 

Let G[V-] be a graph with the vertex set V and the edge set containing those edges from 
Q [n, m,p) which have both ends in V- (i.e. its edges form a clique with the vertex set V-). 
For each 1 < i < m, we construct independently a coupling of G=i,2 {n, ^'~^^' ) UG=„3 (n, Zj) 
and Q[V-]. Given Yi = i/i and Zi = Zi, for each i independently, we generate instances of 
G*2 {n, ^'~^^' ) and G*3 (n, zi). Let F/ = ?/• be the number of non-isolated vertices in the 
constructed instance of G*2 {n, ^'~^^' ) U G*3 (n, By definition ?/• < l/j. Set now V- to 
be a union of the set of non-isolated vertices of G*2 {n, ^'~^^' ) U G*3 (n, Zi) and yi — y[ 
vertices chosen uniformly at random from the remaining ones. This coupling implies 



«*2 



^,^^^)UG,3 {n,Z{)^g[V,]. 
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Graphs G,2 {n, Y^^) uG,3 (n, Z^), 1 < i < m, are independent, and Q[Vi\, 1 < i < m, 
are independent. Therefore by Fact [2] and the definition of Q {n,m,p), we have 



(11) 



(n, M2) U G,3 (n, Ms) = |J (G*2 (n, ^i^) U G*3 {n, Z,)) 

l<i<m 

^ U ^[^^] 

l<i<m 

= ^ {n,m,p) . 



By definition 

mm mm 

E^r, = Ej](Xi-Ix,=i) = ^i and E ^ = E ^(Ix.i.odd - Ix,=i) = ^3 



i=l 

Moreover 



i=l 



Var 5^ = 5^ (VarX, + Varl^,=i - 2(EX,Ix,=i - EX,EIx,=i)) 

m 

< J2 (^^* - EI^.=i + 2EXiEIx,=i) 

i=l 
m 

= ^(Ey, + 2(np,)2(l-p,)""i) 



i=l 

m 



1=1 
<45i 



and 



VarJ^ = J2 (Varlx.odd + Varlx,=i - 2(EIx,oA=i - EIx,oddEIx,=i)) 



i=l i=l 
m 

< ^(EI 

Xi odd EIx.=i + 2EIx,oddEIx,= 

i=l 
m 

< J2 (E^* - Enx,=i + 2EX,EIx,=i) 

i=l 

<45i. 



Therefore by Chebyshev's inequahty, for any u', 1 ^ (1^')^ ^ •S's/^/Si, with high proba- 
bihty 



(12) 



Pr 



Pr 



i=l 



i=l 
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Thus with probabihty 1 — o(l) 

M2>- ^- 

M3 > S3 - u'^i. 

Therefore by Fact [1] 

G*2 (n, M2) U G*3 (n, M3) ^ ^ (n, m, p) . 

We may assume that in the above couphng G=i.2 (^n, '^i~3-5'3-4a; VST j g^^^^^ ^^^^ _ ,^'^5^^ 

are independent. The main reason for this is the fact that even though M2 and M3 are 
dependent (i.e. also G*2 {n,M2) and G*3 (n, M3) are dependent), the choice of a hyper- 
edge of EI^,j(n, ■) in a given draw in the construction of G^:2 {n, M2) and G=i,3(n, M3) is 
independent from choices in other draws. Moreover note that in the couphng, in order to 

U 



get G,2 (n, M2) U G,3 (n, M3) from a sum of independent graphs G,2 (n, Si-^^^-^' Vsl ^j 
G*3 (n, 5*3 — cj'a/ST) we may proceed in the following way. Given M2 = m2 and M3 = m^: 

- a 1712 ( or m3 resp. ) is larger than (5*1 — 35*3 — 4:Uj'y/Si)/2 { or S3 — u'^/Sl resp. ) 
then we make m2 — {Si — 35*3 — 4a;'\/S7)/2 (or m^, — {S-^ — oj'yfSl)) additional draws and 
add hyperedges to H,2(n, ( e,3(n, ^3 - uj'^/Sl) resp. ) 

- if m2 (m3 resp.) is smaller than (5*1 — 35*3 — Auj'y/Si)/2 ( or 6*3 — uo'^/Sl resp. ) then 
we delete from H,2(n, Si-^s^-^^'Vsl ^ ^ H*3(ra, ^3 - uj'\^i) resp. ) hypredges attributed 
to the last draws to get exactly mj, i = 2, 3, draws. 

Let M2 and M3 be random variables with the Poisson distribution 

Po ^-^ )l^2-^ and Po ^5*3 — 2a;' , respectively. 

Then by sharp concentration of the Poisson distribution 

Pr < '^^ ~ 4a;' I ^ ^ _ Pr < ^3 - V^} = 1 - o(l). 

Therefore by Fact [H Fact E] and ([7]) 

G2 In, 1 - exp I I I U G3 I n, 1 - exp I 

= G*2 (n, M2) U G,3 (n, M3) 

^i_o{i) G*2 fn, ^^^i — 36*3 4a;v^ \ ^ / ^ _ ^/^^r 



^i-o(i) Q {n,m,p) . 
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'5*^ _ 35*, — Sw'a/S'i 
J*2 ( Po ( — ) ) U G^,3 ( n, Po ( — 2cj'\/ 5*1 



For Si = o{n^) (then also 5*3 < = o{n^)), u = 5co', and p2 and ps as in ([2]) 
^5 ^^-^-^H 

— s — -'"^''''l — © 

Therefore using standard couphngs of G2 {n, ■) and H^{n, ■) finally we get 

G2 {n,p2) U ^3 in,p3) 

^ G2 I 1 - exp I ^^^j^ I I U 6-3 U, 1 - exp I 

■ 5c. 

2 

^i-o(i) ^ {n,m,p) . 

Therefore the result follows by Fact HI □ 

4 Vertex degrees in Q [n^m^p] 

For any graph G denote by 5{G) the minimum vertex degree in G. 
Lemma 1. Let maxi<j<mPi = o((lnn)~-'^) and Si he given by ([T]). 

Si = n(lnn + c„) 

then 

lim Fi{5{g{n,m,p)) > 1} = /(c„), 

where /(■) g'Zi'en 6y 

Note that the condition maxi<j<mPi = o((lnn)~^) is necessary. Otherwise the number of 
vertices of a given degree depends more on the fluctuations of the values of the vector p. 
For example let m = and p equals 



bn bn 1 1 \ /I /lnn + c„ /Inn + c^ 

Inn ' Inn' nm' ' nm/ I \^\nn' V nm ' ' V nm 



lnn{ln n+Cn)/bn 



for some 6„, c„ = o(lnn). In both cases 

m 

Y,np^ (1 - (1 - PiT'') = n{\ian + c„ + o(l)) 



1=1 
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but the expected number of vertices of degree in ^ {n,m,p) is 

(1 + o(l))exp |^-c„ - and (1 + o(l)) exp(-c„), 

respectively. 

Lemma 2. Let maxi<j<mPi = o(lnn"^), k be a positive integer, Si and Si^t, t = 2 
be given by ([1]) and, c„ — )■ oo. 



(^) If 



Si = n [ Inn + {k — 1) In [ max I 1, ( In n 



then 



Si 

lim Pr {S{Q {n, m, p)) > A;} = 



(n) If 



Si — Si^t = n ^In + (A; — 1) In ^max 1 1, (^~^ 



n 



then 



lim Pr {S{Q {n,m,p)) > A;} = 1. 



Here and in the proof we assume that Yl't=3 ^^,t ~ ^- -^^^ ^ the result of Lemma[2] 

may be improved. 

Lemma 3. Let k be a positive integer and m ^ In^ n. If 

Pd - (1 - P)"-) = ^ . 

m 

then 

for Cn — > — oo; 



lim Pr {S{Q (n, m,p)) > k} 



1 for Cn oo. 



Proof of LemmalJi In the proof we assume that c„ = o(lnn). Note that if p' = {p'^, . . . , p'^) 
and p = {pi, ■ . ■ ,Pm) are such that p'j < pi, for all 1 < z < m, then Q (n, m,p') ^ Q (n, m,p) 
and — (1 — p'i)"'~^) < ~ (1 ^Pi)"^^)- Therefore by monotonicity of 

the considered property, analysis of the case Cn = o(lnn) is enough to prove the general 
result stated above. 

Consider a coupon collector process in which in each draw one choose one coupon 
uniformly at random from V. In order to determine the minimum degree we establish a 
coupling of the coupon collector process on V and the construction of Q {n,m,p). Define 
V- and Yi as in (|9]) and ( TTOl) . We consider the process in which we collect coupons from V 
and at the same time we construct a family of sets {V- : i = 1, . . . ,m} (i.e. equivalently 
we construct an instance of Q {n,m,p)). Assume we have a given vector 

{Yi, Y2,..., Ym) = (yi, 2/2, • • • , ym) 
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chosen so that Yi are independent random variables with distribution of the random 
variables defined in (fTOj) . We divide the process of collecting coupons into m phases. In 
the i-th phase, 1 < i < m, we draw independently one by one vertices uniformly at 
random from V until within the phase we get t/i distinct vertices. Let V- be the set of 
vertices chosen in the i-th phase. We construct an instance of Q {n,m,p) by connecting 
by edges all pairs of vertices within V/, for all 1 < i < m. Obviously after the m-th phase 
Ui<i<m ^' non-isolated vertices in Q {11,171, p). Denote by Tj the number of 

vertices drawn in the z-th phase. U Yi = yi = then Tj = 0. If Yi = yi > 2 and in the 
i-th phase we have already collected j < yi vertices then the number of draws to collect 
the j + 1-st vertex has geometric distribution with parameter Therefore 

y^~^ n 1 (\^yi-^ 2i _ yiivi-l) for 9 < 7/. ^ r) /9- 



j=o ^~ 3 ~^ j [ n In n for y^ > n/2. 



Note that 



n 



p1 

Inn 



Thus 

n/2 



E(T, -Yi)<Y, ^^^^^ Vf (1 -p^r-''' + nInnPr \y, > ^} < 2np 



Therefore 



m 



i=l 

m f 1 ] 



J=l J i=\ 

m 



1=1 

< 



m ^ 



mini<i<„{p. ,n] .^^ _ 

«(ii)f:-p^ (i-(i-p.r-) 

^ ^ 1=1 



Inn^ 

where Si is defined as in ([1]). Moreover by Markov's inequality for any u 
Therefore with high probability 

(13) V(Ti - Y,) < , for any K < — -, 

jr^ a; Inn (In n E(2^.^-L(Ti - F^))) 
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Given K w < min{5i/(lnn E(^™^(Ti - Yi))); y/S^/\nn} let 

^1 



5*1 — uj\/ Si and = Si + coy' Si + 



ulnn 

In the probability space of the coupling described above define events: 

A- - all coupons are collected in at most T_ draws; 

A+ - all coupons are collected in at most draws; 

A - 6{Q (n, m,p)) > 1; 

B - the construction of Q {n, m, p) is finished between T_-th and T+-th draw; 

Bi - the construction of Q {n,m,p) is finished in at most X^I^i ^* + 5'i/(a; Inn); 

B2 - in ^ {n, m, p) we have Si — uj\/Si < YlT=i ^ (jJ\fSi- 

By definition 

^+ n c ^ n i3 c ^_ n i3. 

Therefore 

(14) Pr {A+} - Pr {Bl} < Pr {A} < Pr {A-} + Pr {B^} . 

Let Si = n(lnn + c„), then 

T± = n(lnn + Cn + o(l)). 
Therefore by the classical results on the coupon collector problem 

PrM_}~PrM+}~/(c„) 

and by ^ and ^ 

Pr {B^} = Pr {Bf UB^} = 1- o(l). 

Thus the lemma follows by substituting the above values to f|T4l) . □ 

The proofs of Lemmas [2] and [3] rely on a technique of dividing Q (n, m, p) into sub- 
graphs. We gather here some simple facts concerning this division. For each 2 < t < k, 
let Qt {n, m, p) be a random graph with a vertex set V and an edge set consisting of those 
edges from Q {n,m,p), which are contained in at least one of the sets {Vi : \Vi\ = t}. 
Moreover let Qk+i {n, m,p) be a subgraph of Q [n, m,p) containing only those edges which 
are subsets of at least one of the sets {Vi:|Vi|>A; + l}. Define 

m m k 

Mt = Y^ Iy,=t, for alH = 2, . . . , A;, and M^+i = - J] 

2=1 i=l t=2 

Note that, for all t = 2, . . . , fc, we have Qt {n, m, p) = G*^ (n, MJ, where (n, ■) is defined 
as in Section [31 Moreover 

EMt = ^ for all t = 2, . . . , A; - 1, and EMk+i = '^i - ^ Si^t, 

t=2 
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where 5*1 and Si^t are defined as in ([T]). For all 1 < t < A;, Mt is a sum of independent 
Bernoulli random variables and Si^t < •S'l. Therefore by Chebyshev's inequality, Markov's 
inequality, and by ( IT2|) for any w — )■ oo with high probability 

(15) max jo, ' ^^^ j < m, < + for alH = 2, 3, . . . , 



(16) M,+i > ^1 - 5^ S,,t -kcu^/sl 



t=2 



(17) Mfc+i + jMj - tMt <Si- Si^t + 2^^^^, for alH = 2, 3, . . . , fc. 

Moreover, for any t = 2, 3, . . . , /c, if Si^t — ?■ oo then with high probability 



t ~ t 

Set u a function tending slowly to infinity. In the proofs we will assume that u is small 
enough to get the needed bounds. 



For t = 2, . . . , A; let 



5*1 

Tt+ = Si - Si^t + 2wa/S\ ■ 



wlnn' 

r k 

T_ = max < 0, 5*1 — Si^t — kuj\/S 

t t=2 

Si^t-2ujJs;;t-t\Sl^n-' 



(19) t(^) 

Sli + 2u^fS^t 



Pt+ 



0, for Si^t = 0{uj^) 
t(j^ — ' ' otherwise. 

For all t = 2,...,k Qt{n,m,p) = G^:t {n, M2), thus by (fT5|) and (fT8|) . using the same 
methods as in the proof of Theorem [1] we can show the following fact. 

Fact 5. Let qt and pt± be defined as in f|T9l) . Then 

Gtin, qt) ^i-o(i) Qt {n, m, p) , 

where Gtin^qt) is independent from (Qj{n^m,p))j=2,...k+i,ji^t- 
Moreover if Si^t co then 

Gtin,pt-) ^i-o(i) Gtiri,m,p) ^i_o(i) Gtin,pt+). 

where Gt{n,pt±) is independent from (Qj {n,m,p))j=2,...k+i,jj^t- 
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As in the proof of Lemma [T] we will use the coupling of the coupon collector process 
on V and the construction of Q {n,m,p). However here, for some t = 2, . . . ,k, in the 
process we omit rounds in which Yi = t. Therefore we construct IJj=2 k+i-j^t^t {n,m,p) 
instead of Q {n,m,p). Reasoning in the same way as in the proof of Lemma [1], by the 
definition of Mf, (IT^ . and ( 1T7|) we get that with high probability the construction of 
Uj=2 k+i-jj^t P) finished before the Tj+-th draw. Similarly, if in the process 

we omit rounds with < k, then with high probability we construct Qk+i {n,m,p) in at 
least T_ draws. Therefore analogous reasoning as this used in the proof of Lemma [1] gives 
the following facts. 

Fact 6. There exists a coupling of the coupon collector process with the set of coupons 
V and the construction o/ IJj=2 k+i-j^t^j i''^^^^P) ■^'wc/i that with high probability the 
number of collected coupons in Tt+ draws is at least the number of non-isolated vertices 

Fact 7. There exists a coupling of the coupon collector process with the set of coupons V 
and the construction ofQk+i (n, m, p) such that with high probability the number of collected 
coupons in T_ draws is at most the number of non-isolated vertices in Qk+i {n,m,p). 

Proof of Lemma\^ In the proof we use notation introduced in ( fT9l) . Moreover let a„ = 
81,2/ Si. 

(i) We restrict our attention to the case c„ = o(lnn). In the latter cases the result follows 
by Lemma [H 

First consider the case a„ = 5'i_2/'S'i ^ 1/lnri. Then S'i_2 00. Take any probability 
space on which we define the coupon collector process on V and G2 (n,p2+), in such a 
manner that they are independent. Let X+ be a random variable counting vertices which 
have not been collected during the coupon collector process in T2+ draws and have degree 
at most — 1 in G2 (^^5^2+)- If 

5*1 = n(ln n + [k — 1) ln(a„ In n) — c^) 
then for a; tending to infinity slowly enough 

-T2+ + {n-l)p2+ = - {si + o{uj^i + -f^ ) ) =lnn+(A:-l)ln(a„lnn)-c„ + o(l) 
n n \ \ u Inn J J 



and 



Therefore 



an In n 

P2+ - 



n 



n—k 



, i=0 



~n— TTri'^n Inn)^ ^ exp(— Inn — (A; — 1) ln(a„ Inn) — c„ 
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and 



EX,(X, - 1) (l - 1)^" (" ,^ ')pU(1 - ft 




-r \ 2 



Thus by the second moment method with high probabihty X+ > as c„ — )■ — oo. Note 
that if there is a vertex which is isolated in lJt=3 Gt i'n, m,p) and has degree at most k — 1 
in Q2 {n,m,p), then S{Q {n,m,p)) < k — 1. By Facts [6] and [5] there is a probabihty space 
such that with high probabihty Q2 {n,m,p) C G2 {n,p2+), the number of isolated vertices 
in lJt=3 (n, m, p) is at least the number of non-collected coupons after T2+ draws, and 
G2 {n,p2+) and the coupon collector process are independent. Thus X+ > imply that 
with high probability 6{Q {n, m,p)) < k — 1. 

Now let S'i^2/'S'i = 0((lnr;,)~^). Then Si = n(ln?T, + c„ + 0(l)). Therefore for the result 
follows by Lemma [H 

(ii) In the proof we restrict our attention to the case c„ = 0(ln?T,). In the latter case the 
result follows after combining ([3]) with known results on G2 {n,p). 

Let a„ = S'i,2/'S'i ^ 1/lnn (thus 6*1,2 — ^ 00 and a„ — )■ 00). Consider any probability 
space on which we define the coupon collector process on V and G2 {n,p2-), in such 
a manner that they are independent. Let X_ be a random variable defined on this 
probability space and counting vertices which have not been collected during the coupon 
collector process with T_ draws and have degree at most A; — 1 in G2 {n,p2-). If 5*1 — 
Y^=d, "^i,* ~ n(lnn + (A; — 1) ln(an Inn) + Cn) then for oj tending to infinity slowly enough 

/ fc-i 

-T_ + (n - l)p2- >-\Si -ySi,t + O L^/S'^ + Sf 



t=3 




= Inn + (A; — 1) ln(a„ Inn) + c„ + o(l). 

Thus 

exp(— Inn — (A; — 1) ln(a„ Inn) — c„) = 0(1) 




(k-iy. 



Therefore with high probability X_ = 0, i.e. with high probability each vertex is collected 
in T_ draws or has degree at least k in G2 (n,p2-)- Note that if each vertex is non- isolated 
in Qk+i {n, m,p) or has degree at least k in Q2 {n, m,p), then 6{Q (n, m,p)) > k. Therefore 
Facts [5] and [7| imply that with high probability 6{Q {n,m,p)) > k. 
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Now let S'i,2/'S'i = 0((lnn) i.e. = 0{n). Then 5*1 — Ylt=2 "^i.* ~ n{lnn + Cn + 
0(1)) and ' 

T„ =n(lnn + c„ + 0(l)). 

Thus by Fact [7] with high probabihty there is no isolated vertex in Qk+i {n,m,p), i.e. with 
high probability 6{Q {n,m,p)) > k. □ 



e In n \ ^ ^ e "'^Inn 



Proof of Lemma\3i We use notation from (|T9|) . Moreover let 

«n = M'-' I ( ) + 

Note that 



1 - e-"P / 1 - e""P 



where 5*1 and 6*1 are defined as in ([T]). Therefore as far as np^ = o{l) 

inn H — -— lnn~a„. 



5*1 / 5*1 

If Si^t = 0{Si/ Inn) for all t then a„ = 0(1) and ^^Inn = 0(1). Therefore the lemma 
follows by Lemma [21 It remains to consider the case: S'12 ^ Si /Inn or Si^k ^ Si /Inn. 
Note that in this case if c„ = O(lnn) then np"^ = o(l) and a„ — )■ 00. 

Let now c„ — )■ — 00. As in the proof of Lemma [2] we may restrict our attention 
to Cn = o(lnn). If e~"^lnn./(l — e""^) = fi(l), then the lemma follows by Lemma El 
therefore assume that 

(20) ^--^ = 0(1) (i.e. a„ ~ M'^-^-nnn) . 

By fl20l) Si^jt — > 00, thus we may apply the second part of Fact [51 

Take any probability space on which we define independent coupon collector process 
on V and Gk{n,Pk+)- Let be a random variable counting vertices which have not been 
collected during the coupon collector process with Tfc+ draws and have degree at most 
— 1 in Gk{n,pk+)- For u tending to infinity slowly enough 



-Tyt++ = - Si + O wa/Si + Sin H = lnn + lna„ + c„ + o(l), 

n \k — \ n\ \ wmn'' 



thus 



EX,=n(l-l)'" ((l-p..)(-)+ (^:J)p..(l-R.)(-)-) 



~n-— — a„exp(— Inn — lna„ — c^) ~ 



(/t-i)! " " (fc-i; 
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and 



\k-2j 



(k-iy. 

Thus with high probabihty X+ > as c„ — )■ — oo (i.e. with high probabihty there is a 
vertex, which has degree at most A; — 1 in Gk{n,Pk+) or has not been collected in Tfc+ 
draws). Thus by Facts [5] and [6] with high probability 6{Q {n,m,p)) < k — 1. 

Now assume that c„ — )■ oo. As it is explained in the proof of Lemma |2] we may restrict 
our considerations to the CcLSG Cfi = O(lnn). Recall that we assume that Si^2 ^ 5*1/ Inn 
or Si^t ^ 5*1/ Inn, i.e. — )■ 00. Note that by definition 

< ^ = O (mp*e-"^) . 

Take any probability space on which we may define independent coupon collector process 
and Ut=2 ^*)- ^® ^ random variable counting vertices which have not been 

collected during the coupon collector process in T_ draws and have degree at most k — 1 
in IJt=2^t(^t)- Note that if v has degree at most A; — 1 in IJt=2^t(^0) then for some 
< ko < k — 1 and a sequence of integers r2, . . . , such that J2t=2i^ ~ ~ ^0 

(i) there is a set V C V of k^ vertices such that for each t there are hyperedges in 
Ht{qt) (generating Gt{qt)) contained in V U {v}. 

(ii) and all hyperedges in |Jt=2 Ht{qt) containing v are subsets of V U {f }. 
Let r = X]^=2^*' then for c„ = O(lnn) event (i) occurs with probability 



t=2 



0{l){npf\mpe-'''py 



1 - e~"P) 



r 



0(aO 



Moreover 



1=2^'' \ t=3 



Inn + lna„ + c„ + o(l). 
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Thus 

EX_ =n\^l--j 0(a„) - = o(l). 

Therefore with high probabihty X_ = 0. By Facts [5] and [7] with high probabihty 
6{Q {n, m,p)) > k. □ 

5 Structural properties of G2 {n,p2) U G3 {n,ps) 

For any graph G and any set S C y(G') denote by Nq the set of neighbours of vertices 
form S contained in V{G) \ S. For simphcity we write A''2(-) = ^G2{n,p2)(') ^3(") = 
NG2{n,p,)uG,{n,p^)i-). Note that |iV2(5)| < |iV3(^)|. 

We first show that G2 {n,p2) U G3 {n,p3) with high probabihty have several structural 
properties. 

Lemma 4. Let k and C be positive integers and 

9n\nn 1 

P2 + -P3 = for 9n^ > 

In Z 

and'-) > 1—6'. Then with high probability G 2 {n,p2)UG3 {n,p3) has the following properties 
(i) Bi - for all S CV such that n'^ < \S\ < 

\NsiS)\ > 2\S\. 

(a) B2 - for all S CV such that n'^ < \S\ < 

\N3{S)\ > min{|S'|,4nlnlnn/lnn} > min{|S'|, a(G2 {n,p2) U G3 (^,^3))}, 
where a{G2 {n,p2) U G3 (^^,^3)) is the stability number of G2 {n,p2) U G3 (^,^3). 

(Hi) Bs^k - for all S C V if 1 < \S\ <n^ and all vertices in S have degree at least Ak + 15 
in G2 {n,p2) U G3 {n,p3) we have 

\N3{S)\ > 2k\S\. 

(iv) Bi^c " ciny two vertices of degree at most C are at distance at least 6. 

(v) Br-, - contains a path of length at least 

^ 81n2^ 
Inn 
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Proof, (i) and (ii) By Chernoff's inequality (for the proof see for example Theorem 2.1 in 
[To] ) for any 6 = o{l) and any binomial random variable X we have 

Pr{X < 6EX} < exp{-EX{6\n6+l-5)) = exp(-(l + o(l))EX). 

First consider the case n'^ < \S\ < 4n In Inn/ Inn. Let s = \S\. Then |A^2('S')| has the 
binomial distribution Bin (n — s, 1 — (1 — and E|A'"2(S')| S> 215*1. Denote by X the 
number of sets S of cardinality n'^ < \S\ < 4nlnlnn/ln?T, such that |iV2(S')| < 215*1. Using 
Chernoff's inequality. 

4n In In n/ In n , ^ 

^ E ( J (-(1 + -s){i-{i- P2r)) 

n/ (In n. In In n) 

< exp (^s ^1 + In — j — (1 + o{l))snp2^ 

4rt In In n/ In n , 0' \ 

+ E -pHi+'"i)-"21^) 

s=n/(lnn In Inn) 
n/(ln n In In n) 

< exp(s(l + (1 -7)lnn- (l + o(l))^;inn)) 

4n In In n/ In n , , \ \ 

+ E -pK(i+°(i))'"'""-8(i:nir^) 

s=n/(lnn In Inn) 

= oil). 

Therefore with high probability for all sets 5* C V such that rC < \S\ < 4n In In n/ In tt, we 
have 

\N2{S)\>2\S\. 

Now consider the case IS"! > 4nlnlnn/lnn. Let r = 4nlnlnn/lnn. Let moreover 
Kr and Kr^r be the complements of the complete graph on r vertices and the complete 
bipartite graph with each set of bipartition of cardinality r. Denote by Xr and Xr^r the 
number of Kr and K^^r in G2 (^,^2) U G3 {n,f)^), respectively. Then 



and 



EX.= Q(1-^2)(^)(1-P3)(3H^)("-^^^ 

— exp (^"2 (P2 + {n- r)p3) + o(l) 
/ 6 In Ti \ ^ 



y exp y-^{P2 + {n- r)p3) + o{l] 



2r 

= o(l). 
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Therefore with high probabihty X^. = which imphes that with high probabihty 

a{G2 {n,p2) U (^,^3)) < r. 

Moreover with high probabihty Xr^r = thus with high probabihty for any 5 C V such 
that r < jS"! we have 

NsiS) > n - 1^1 - r. 

(Otherwise G2 {n,p2) U G3 (n,p3) would contain X^.r-) 

Therefore with high probabihty for any S CV such that r < \S\ < 2n/3 

Ns{S) > min{|5|, r} > mm{\S\,a{G2 {n,p2) U {n,ps))}. 

Moreover with high probabihty for any S CV such that r<|5'|<n/4we have 

NsiS) >n-\S\-r> 2\S\. 

This finishes the proof of (i) and (ii). 

(iii) For any two disjoint sets S* C V and S" C V wc denote by 6(5*; S') the number of edges 
in G2 {n,p2) U Ga {n,ps) with one end in S and one end in S' and by e{S) the number 
of edges in G2 {n,p2) U G3 (n,p3) with both ends in S. We will bound numbers e{S) and 
e{S,S') for \S\ < and S' C V \ S such that \S'\ = 0{\S\). Let ,5 C V, -S' C V \ S, 
\S\ = s, 1 < s < rf and \S'\ = 2ks. A pair {v,v'} {v & S and v' G 5") is an edge 
in G2 {n,p2) U G3 {rijp^) if one of the three following events occurs: {v,v'} is an edge in 
G2 {n,p2), there is a hyperedge {v,v',v"} with v" e V \ (S* U S') in H^ip^), there is a 
hyperedge {v, v' , v"} with v" e S\JS' \n Hs{p^). At least one of the two first events occur 
with probability at most P2 + (n — {2k + l)s)ps — 0{lnn/n) independently for all v & S 
and v' e S'. Moreover each hyperedge with all vertices in 5 U S" appears independently 
with probability p^ — 0{lnn/n^) and generates two edges between S and S'. Therefore 




=0(1). 
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Similarly 



Pr<j35cv e(^)>5|S| 

|5|<nT 



s=l ^ 



2s 



= o(l). 

Therefore with high probability for any set 5* (1 < 15*1 < n''') of vertices of degree at least 
4fc + 15 in G2 {n,p2) U G3 (^,^3) we have 

\Ns{S)\>2k\S\. 

Otherwise for S' = NsiS) there would be |^'| = \N3{S)\ < 2k and e{S,N3{S)) + 2e{S) 
would exceed (4(A; + 1) + 10)|^|. 

(iv) The probability that in G2 {n,p2) UGs {rijp^) there are two vertices of degree at most 
C at distance at most 5 is at most 
4 

' c L'V2J //„-l-^^ 
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j=0 jr=0 

= 0(l)n(lnn)^e-'"(^^+tP3) = ^(l). 

(v) Follows by Theorem 8.1 from [1]. □ 
Lemma 5. Let k be a positive integer 

Inn + (A; — 1) Inlnn + c„ 

P2 + 7tP3 - 



2 n 
(i) If k = 1 then 

Ft {5iG2 in,p2) U G3 in,ps)) > 1} ^ /(c„ 

(a) //c„ — )■ 00 then 

Ft {6iG2 {n,p2) U G3 (^,^3)) > A;} 1. 



Proof, (i) Follows by Theorem 3.10 from [T6] . 

(ii) Let Xf be the number of vertices of degree < t < k — 1 m. G2 {n,p2) U G3 {n.p^). 
Then 

= 0(l)n(lnn)* exp(— Inn + {k — 1) In Inn + c„) = o(l). 
Thus with high probability = for alH < A; — 1. □ 
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Lemma 6. Let k be a positive integer, On -^6,6'^^ 0' and 6,6' > |. Let Q{n) be a 
random graph such that for 

^ n ^ 6^^ In n 6' Ann 

(21) p2 + -P3 = and p2 = 

2 n n 

we have 



(22) G2 (n, P2) U G3 (n, ps) G(n) 

and in the probability space of the coupling with high probability all vertices of degree at 
most Ak+li in G2 {n,p2)UG3 {n,p3) are at distance at least 6 in G{n) . If Pi {6 {G{n) > k)} 
is bounded away from zero by a constant then 

(23) Pr {G{n) G Ck\6{G{n)) > k} ^ 1, 

(24) Pr{G(2n) G VM\6{G{2n)) > 1} ^ 1. 

In particular, if we substitute G{n) = G2 {n,p2) U G3 {n,p3) then by Lemmas |5] and [6] 
we obtain the following result. 

Lemma 7. Let p2 and p^ fulfil f l2T]) . 
IfP2 + fPs = (Inn + Cn)/n then 

Pr{G2 (n,p2) UG3 {n,p3) E Ci} /(c„); 
Pr{G2(2n,p2(2n)) UG3(2n,p3(2n)) G PA^} ^ /(c2„), 

where /(■) zs defined by 

If P2 + fp3 = (Inn + (A; — 1) In Inn + c„)/n and — )■ 00 t/ien 

Pr {G2 {n,p2) U 6-3 (n,p3) G C^} ^ 1. 

Proof of LemmalBi Denote by G{n)s>k a graph G(n) under condition 5(G(n)) > k. If 
Pr{5(G(n) > A;)} is bounded away from zero and G{n) has certain property with high 
probability, then also G{n)s>k has this property with high probability. 

From now on we assume that G2{n,p2) U G^{n,p^) and G{n) are defined on the 
same probability space existing by fl22p . We call a vertex f G V small if its degree in 
G2 {n, P2) U G3 (n, P3) is at most Ak + 14. Otherwise we call a vertex large. Let S" C V and 
\S\ <n^. Denote by and S~ the subset of large and small vertices of S, respectively. 
Then by Lemma Hl^iii) with high probability 

\N^in)s>_,{S^)\>mS^)\>2k\S^\. 

Moreover in G{n)s>k all vertices in S~ have degree at least k and with high probability 
are at distance at least 6. Therefore with high probability in G{n)s>k no two vertices in 
are connected by an edge or have a common neighbour and at most 15*^1 of them have 
neighbours in N^{S^) U (otherwise they would be connected by a path of length at 
most 5). Thus with high probability 

V5cv,i<|5|<n.|iVG(n),>,(5)| > |iVG(„),>, (^+) | + fcmax{|S-| - |^+|,0} > k\S\. 
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If we combine this with Lemma Hl^i) and (ii) we get that with high probabihty 
(25) 

iVG(n),>,(^) > k for all 5 C V, 1 < |^| < 

(26) 

NQ,(n)s>k ("S*) > minds'!, 4nlnlnn/ Inn} > minjlS*!, a{Q{n) s>k)} for all S* C V, 1 < |S| < 
(27) 

NG{n)s>,{S) > 2\S\ for all S C V, 1 < \S\ < 

Finally f l25]) follows immediately by f l2^ . Moreover if f l2B]) is fulfilled then G{n)s>k has a 
perfect matching (for the proof see for example [3]). Therefore (12^ follows. fl27|) will be 
used later to establish threshold function for a Hamilton cycle. □ 

Lemma 8. Let k be a positive integer, On — t- 6 , — t- 6' , 9,9' > ^ and 

n„ 9n\nn ^ 9' Inn ,^ 512 

P2 + -P-i = ' P2 = and Pi = . 

2 n n n 

Let moreover G{n) be a random graph such that: 
(i) G2 {n,p2) U Gs {n,p^) ^i_o(i) G(n); 
(ii) with high probability 5{Q{n)) > 2; 

(Hi) in a probability space existing by (i) all vertices of degree at most 22 in G2 {n,p2) U 
G3 {n,p^) are at distance at least 6 in Q{n). 

Then 

Pr {Gin) U G2 in, pi) G HC} 1. 
In particular if p2 + fps = (Inn + In Inn + Cn)/n and — i- 00 then 

Pr {G2 (n, P2) U Gg (n, pg) eUC}^!. 

Proof. We will follow the lines of the proof of Theorem 8.9 from [3j. Let t = 8n,/lnn and 
P4,o = 641nn/n^. Then tp^^ = p^. For any graph G let 1{G) be the length of the longest 
path and 1{G) = n if G has a Hamilton cycle. We say that G has property Q if 

G is connected and |A/'g(S)| > 2|^|, for all S C V, |^| < n/4. 

In the proof we assume that G2 (n,p2) UG3 (n,p3) and G(n) are defined on the probability 
space of the coupling existing by (i). Let Go = G(n) and 

Gj = Gj_i U G2 {n,pifi) , for 1 < 2 < t. 

By Lemma [6l assumptions (i)-(iii), and (p7|) with high probability G(n) has property Q. 
Moreover by Lemma |4](v) with high probability /(Go) > 1 — (8 In 2/ Inn). It is shown in 
[1] that 

Pr{Z(Gi) = n-t + i-l|Z(Gi_i) = n - t + i - 1 and Gi_i has Q} < (l-p4,o)"'^^^ < n'^ 
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Thus 

Pr {/(Gt) =n]>l--^- o(l) = 1 - o(l). 



71^ 



Since 

Gt ^ G(n) U G2 (n, tp^fi) , 
this finishes the proof. □ 

6 Sharp thresholds 

Proof of Theoremsl^^ First let 5*1 = n(\nn + c„) and p2 and ps be given by ([2]). Note 
that P2 > Inn for some constant e > 0. Then by Theorem [1] and Lemma [7] 

hm Pi {g{n,m,p) G > fim Pr {^2 (^,^2) U G3 (n,p3) G Ci} = /(c„) 

and by Lemma [U 

hm Pr {Q (n, m,p) G Ci} < hm Pr {6{Q (n, m,p)) > 1} = f{cn)- 

This imphes Theorem [21^i). Remaining theorems foUow by Theorem [T] and the following 
results: Theorem [2t^ii) by Lemma [7] and Lemma [2l Theorem [3] by Lemma [7] and Lemma [U 
Theorem H] by Lemma [8] and Lemma [2j □ 

In the following proofs we will assume that c„ = O(lnn). In the other cases theorems 
follow by Lemma [1] or ([3]) combined with known results concerning G2 {n,p). 

Proof of Theoreml5[ Let c„ — i- —00, then by Lemma [3] 

Pr {g (n, m, p) G UC} < Pr {6{g {n, m,p))>2} ^ 0. 
Let now c„ — )■ 00. First consider the case m = Q{n/ Inn). Let moreover 

m = m 1 + and m = m . 



Inn J Inn 
Then for p given by ([6]) 

m'p(l - (1 - = Inn + Inmaxjl, "^Inn j _ ^ ^ 

1 — e~"P J 

and 

nm'p^ = O(lnn) for np = 0(1), 

nm'p^ = — irnpY = O(ln'^n) for np 00. 

m' 
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Let — i- oo. Define 

^„^p(l_(l_p)n-l)_3^,p( l-(l-2p)" _(i_p).-l-)_^yi^ ^ /l-(l-2pr 



P2 



P3 



P4 



m'p(l-(l-p)"" 



n 

n — 1 \ , , / In n 



for nm'p ^ ^2np^'' (■'- ~ P)"" ^ In n; 



otlierwise 



, for nm pi — 2ri (1— p) \^ynuin] 

"6" ^ ^ 

0, otlierwise ; 

// /i l-(l-2p)"\ 



^ 512 



n n 
Let now G(n) = Q {n,m',p). Tlien by Tlieorem[T] 

(28) G2 (n,p2) U G3 {n,p3) ^i-o(i) ^ {n,m',p) = G(n); 

G2 (n, P4) ^i_o(i) ^ P) • 

Moreover by Lemma [3] wit li liigli probability 6{G{n)) > 2. Therefore assumptions (i) and 
(ii) in Lemma M are fulfilled. 

We are left with proving (iii). Let C be a positive integer. We will show that in the 
probability space of the coupling fl28l) with high probability any two vertices of degree at 
most C in G2 (n,p2)U(j3 (^,^3) are at distance at least 6 in ^ {n, m',p) = G(n). Therefore 
we need to study the couplings described in the proof of Theorem [H Recall that 



G2 {n,p2) U G3 (r2,p3) 



n,Po (^(^2) ln(l U G,3 (n,Po (^(^^ Hi ~ ps)-' 

^« G.2 (n, Po (^^l^l^l^^!^^ ^ u G.3 (n, Po (53 - 2c.' 

,(n) ^ f Si - 35*3 - 4:Uj'y/S~i\ , ,^ f r, / fET 

^i_o(i) G*2 \n, I U G=,3 [n, S3-UJ y^Si 



^S(i) G,2(n,M2)UG,3(n,M3) = |J {G,2 (n,^^) U {n, Z,)) 

l<i<m' 

^^'"^ g{n,m',p) 

for some u' = Q{uj). 

First we will show that in the probability space of the above couplings with high 
probability the degree of a vertex f G V in G2 {n,p2) U G3 (^,^3) is well approximated by 
the number of those features contained in W{v), which are chosen by at least two vertices 
(i.e. which contribute to at least one edge in Q {n,m',p)). For any f e V in ^ {n,m',p) 
let 

W'{v) = {w^ e W{v) : > 2}, 

where V/ is defined in ([9]). Denote by Av event that while constructing G=i,2 (n, M2) U 
G*3 (n, M3) in less than |iy(t>)| — 1 draws we choose an edge or hyperedge containing 
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V. Given v & V and 1 < i < m' . Denote by A^^i^i event that v is an isolated vertex in 
G=k2 {n, {Yi — 3Zi)/2) U G=k3 (n, Zi) and Av,i,2 event that v G VI . Then 

A c |J(A,*,i n A,*,2) n n A,j,2). 

For i 7^ j, (A,j,i n A,i,2) and (Aj,i fl ^^,2) are independent and Pr {Av,i,i U ^,1,2} = 1, 
thus 

'm' \ _ _ _ , _2 



2 



Pr{A} < ( 2 ) (Pr{A,i,in A,i,2}) 



< 



m 
2 

m' 
2 



(Pr {A,i,i} + Pr{A,i,2}-l) 



<r'2((i-9"+p-i)"=^("'')v 



Therefore 



Pr i y A [ < n(m') V = o( 



In^n 



n 



0(1). 



Recall that random graph G=k2 (n, ■) U G=k3 (ra, ■) is constructed by making independent 
draws of edges and hyperedges in an auxiliary hypergraph. The number of draws is given 
by random variables. Couplings (i)-(iii) may relay on this construction. In the coupling 
(i)-(iii) in order to get from G,2 (n,Po ((^) ln(l -^2)"^)) UG,3 (n, Po ln(l -ps)"!)) 
a graph 6*2 {n, M2) U G*3 (n, M3) with high probabihty we make some additional draws. 
Moreover by the sharp concentration of the Poisson distribution and (fT2|) with high prob- 
ability the number of additional draws is at most KuVnlnn for some constant K. Under 
condition that the number of additional draws is bounded at most Kuy/nlnn, probability 
that an edge (hyperedge) containing v is chosen in at least 2 additional draws is at most. 



/ Kuj\/n \v^n\ / 2 ^ ^ 



V 2 / V ^ 



oiV. 



Moreover each draw (each chosen hyperedge) generates at most 2 edges incident to v. 
Concluding in the probability space of the coupling (!28|) with high probability the number 
of edges incident to f in (72 (^5 P2) U 6*3 (^, P3) is at least W{v) — 2 — 2. Therefore if degree 
of V in Gi {n,p2) U G3 {n,p^) is at most C then W'{v) < C + 4. 
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Finally, probability that there are two vertices such that |H^'(f)| < C + 4 and 
|Vr'(f')| < C + 4 connected by a path of length at most 5 in ^ (n, m',p) is at most 

= 0(l)n(lnn)°(^) exp (-2m'j9(l - (1 - pT'^)) = o(l). 
Therefore by Lemma M with high probability 

Gin) U (^n, G HC. 

Thus in the case m = Q[n/lnn) the theorem follows by fl28|) and a straight forward 
coupling of random intersection graphs. 

G(n) U G2 (^n, ^i-o(i) G {n, m , p) U Q {n, m" , p) ^ Q (n, m, p) 

Consider now the case m = oinj Inn). Note that in this case ^ implies 

In n + c' 



P 



m 



for some — t- 00. Let m! be such that m divides vn! and m' ~ n/lnn. Take now an 
instance of Q{n,m' ,p') with the set of features W of size m' and 

, In + c' 
P = r^- 

Divide W into m groups of m'/m features. Denote by Si the set of vertices which 
have chosen features from the i-th, 1 < i < m, group in Q{n,m\p'). \Si\ has binomial 
distribution Bin {n,p") for some p" < ^p' = p. Now take an instance of Q{n,m',p') and 
construct the sets Vi, 1 < i < m, in Q (n, m,p) by taking Si and adding to Vi each vertex 
from V \Si independently with probability [p — p") /(I — p")- This coupling implies 

Q (n, m', p') :< Q {n, m, p) . 

From the considerations concerning the first case we have that with high probability 
Q{n,m' ,p') G 1-LC. Therefore also with high probability Q {n,m,p) G TiC. □ 

Proof of Theorem The technique of the proof is analogous to this of the proff of The- 
orem [51 We consider two cases m = Q{n/lnn) and m = o{n/lnn). In the first case 
the proof relies on Lemma [3] and Lemma [61 Moreover we have to use the fact shown 
in the proof of Theorem [5l that with high probability in the coupling constructed to 
prove Theorem [H with high probability the vertices of degree bounded by a constant in 
^2(^5^2) U G^lrijPs) are at distance at least 6 in Q {n,m,p). The proof of the case 
m = o{n/ Inn) is the same as in the proof of Theorem [51 □ 
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